Not End-to-End: Explore Multi-Stage Architecture for Online Surgical Phase Recognition

نویسندگان

چکیده

Surgical phase recognition is of particular interest to computer assisted surgery systems, in which the goal predict what occurring at each frame for a video. Networks with multi-stage architecture have been widely applied many vision tasks rich patterns, where predictor stage first outputs initial predictions and an additional refinement operates on perform further refinement. Existing works show that surgical video contents are well ordered contain temporal making suited task. However, we observe when simply applying task, end-to-end training manner will make ability fall short its wishes. To address problem, propose new non strategy explore different designs For strategy, trained separately proposed two types disturbed sequences. Meanwhile, evaluate three choices models our analysis solution robust specific models. We conduct experiments public benchmarks, M2CAI16 Workflow Challenge Cholec80 dataset. The SOTA comparable results holds great potential boost performance existing single-stage Code available https://github.com/ChinaYi/NETE.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved training for online end-to-end speech recognition systems

Achieving high accuracy with end-to-end speech recognizers requires careful parameter initialization prior to training. Otherwise, the networks may fail to find a good local optimum. This is particularly true for low-latency online networks, such as unidirectional LSTMs. Currently, the best strategy to train such systems is to bootstrap the training from a tied-triphone system. However, this is...

متن کامل

legal aspects of end of life and end stage disease

Abstract: Aims: End of Life (EOL) care providers potentially face by variety of legal and professional dilemma. This study is designed to review and analyze these challenges and approaches to solve them. Materials & Methods: In this narrative review study, legal aspects of end-of-life care in subjects such as medical assisted dying, decision-making capacity and withdrawal of life-sustaining...

متن کامل

End-to-End Speech Recognition with Auditory Attention for Multi-Microphone Distance Speech Recognition

End-to-End speech recognition is a recently proposed approach that directly transcribes input speech to text using a single model. End-to-End speech recognition methods including Connectionist Temporal Classification and Attention-based Encoder Decoder Networks have been shown to obtain state-ofthe-art performance on a number of tasks and significantly simplify the modeling, training and decodi...

متن کامل

Transferring End-to-End Visuomotor Control from Simulation to Real World for a Multi-Stage Task

End-to-end control for robot manipulation and grasping is emerging as an attractive alternative to traditional pipelined approaches. However, end-toend methods tend to either be slow to train, exhibit little or no generalisability, or lack the ability to accomplish long-horizon or multi-stage tasks. In this paper, we show how two simple techniques can lead to end-to-end (image to velocity) exec...

متن کامل

End-to-end esophagojejunostomy versus standard end-to-side esophagojejunostomy: which one is preferable?

Abstract Background: End-to-side esophagojejunostomy has almost always been associated with some degree of dysphagia. To overcome this complication we decided to perform an end-to-end anastomosis and compare it with end-to-side Roux-en-Y esophagojejunostomy. Methods: In this prospective study, between 1998 and 2005, 71 patients with a diagnosis of gastric adenocarcinoma underwent total gastrec...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2023

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-26316-3_25